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Abstract. The distribution of nearby (cz < 7500 km s -1 ) galaxies has been 
explored by minimal spanning trees; allowance has been made for the drastic 
decrease of data with distance. The investigation finds that all galaxies are 
members of irregular elongated large-scale structures; there are no 'field galax- 
ies'. Based on our local large-scale structure, every galaxy appears to have a 
neighbouring galaxy within <100 km s _1 (1.4 K^q Mpc) of redshift space, and 
thereby all galaxies are found to lie in filaments or tree configurations. 

Every large-scale structure appears to have a neighbouring large-scale struc- 
ture within <700 km s _1 (10 h^ Q Mpc), such that large-scale structures inter- 
connect to form a continuous labyrinth. 



1. Introduction 

The texture displayed by the Universe today is one of a labyrinth of large-scale 
structures and voids - first described as 'cellular' by Joeveer and Einasto (1978). 
It is this labyrinth of large-scale structures and voids - the "Cosmic web" - that 
is the focus of much of this meeting (and appropriate that pioneers such as Jaan 
Einasto and John Huchra are present). 

From mappings of local large-scale structures (e.g. Fairall 1998, hereafter 
F98), we now know that the voids range in size up to 85 h^Q Mpc (6000 km s _1 
in redshift space); the closest such large void being the well-known Sculptor Void 
(R.A. 0.5h, Decl. -35deg, cz = 5500 km s _1 ). Voids have a tendency towards 
sphericity; they have often been described as bubble-like. Like the bubbles in a 
bath sponge, they also interconnect (on the basis that interconnecting holes are 
larger than the correlation length of the galaxies surrounding the voids - e.g. 
Gott et al. 1986). Were it possible, one could travel throughout the universe, 
passing from void to void. The most clearly defined voids (e.g. Sculptor, Mi- 
croscopium) are empty, or have substantial empty cores. Other voids look as if 
they were formed from mergers of smaller voids, with the remnants of previous 
boundaries still apparent. 

However, within the research community, there does not seem to be con- 
sensus as to how voids should be defined. Some researchers see them as truly 
'void', others as merely under-densities. Our choice of a working definition for 
voids is simple: they are empty (Fairall et al. 1991). As the Local Void so clearly 
demonstrates, there is no population of low luminosity galaxies filling the voids. 
In the past, we have used a void-finding algorithm that fitted empty, approx- 
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imately spherical, progressively smaller, voids into the distribution of galaxies 
(Kauffmann & Fairall 1991). Where voids appear to have resulted from mergers, 
then multiple empty voids are fitted to fill the cavity 

Large-scale structures are gigantic conglomerations of galaxies. They have 
irregular shapes, since they border on the bubbly voids, or even incorporate 
small voids or holes. However there is a tendency to be flattened (walls) or 
ribbon-like. Large-scale structures interconnect with one another to form the 
labyrinth. 

The galaxies within the large-scale structures tend to lie in chains or fila- 
ments. They wrap around the voids in a fashion similar to the filaments that 
wrap around the hollow interior of the Crab Nebula. It is this tendency that 
has given rise to the texture being described as the 'Cosmic Web' (e.g. Lahav, 
Karachentsev, these proceedings). 

Is there a way to define a large-scale structure? The first author (F98) has 
proposed a working definition that large-scale structures are formed by conglom- 
erations of galaxies, where galaxy-galaxy separations are less that 200 km s _1 
(~ 3 hjQ Mpc). However, we shall show below that, once a reasonably complete 
sample is used, the separations may even be lower than 100 km s _1 (1.4 
Mpc). 

The definition is based on the idea that every galaxy has a neighbour, the 
impression gained by the first author (when preparing the maps for the Atlas 
of Nearby Large-Scale Structures - F98), particularly for very nearby galaxies, 
where data were most complete. A further impression gained was that there 
might not be isolated islands of galaxies. Similarly, all large-scale structures 
might interconnect and none be isolated. If so, all the high density regions, in 
the local Universe at least, would be linked, and one could travel around in the 
universe by keeping to high density regions (again much like the material that 
makes a bath sponge). In Section 3, we attempt to quantify this situation by 
means of Minimal Spanning Trees. 

2. Data and incompleteness 

Given the enormous success of the 2dF and Sloan Digital Sky Surveys, and 
their predecessors, many researchers might consider their databases appropriate 
for investigating the characteristics of large-scale structures. What is not often 
appreciated is that such traditional (magnitude-limited) redshift surveys sample 
at best only the brightest few galaxies of several hundred, and possibly only 
the brightest few galaxies of several thousand. This will be apparent when we 
examine local data below. 

Ideally, what we need to test the 'neighbour' hypothesis is a volume-limited 
sample - i.e. a sample volume of space where all galaxies that exist within 
that volume are known. Unfortunately there is no such thing as a volume- 
limited sample, short perhaps of our local group of galaxies, and even there, 
new members are still discovered from time to time (such as the recent finding 
of Andromeda VIII, Morrison et al. 2003). The Catalogue of Nearby Galaxies, 
announced at this meeting by Karachentsev, is probably the closest approxima- 
tion we have to a volume-limited sample. Unfortunately, it comes too late for 
the present investigation, but will be used in a follow up. 
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What is the closest approximation to a volume-limited sample for nearby 
large-scale structures? The answer is to use all-available redshifts. For this rea- 
son we have turned to the NASA/IPAC Extragalactic Database (NED). Critics 
may argue that this is a statistically uncontrolled sample. Admittedly so, but 
all catalogues suffer at various levels from incompleteness (some are highly in- 
complete). In this light, the best one can do is to use the highest number of 
redshifts available. Furthermore, by restricting our investigation to very shal- 
low redshifts, there is good all-sky coverage, without major selection effects in 
direction. 

The data used below contains all available redshifts with cz < 7500 km s _1 , 
as extracted from NED in April 2003. Entries for galaxy pairs and groups, as well 
as galaxies with blueshifts, have been excluded. The data has been supplemented 
by the recent release of 6dF data (Read et al. 2002), but with targets having 
cz < 200 km s _1 excluded as they were more likely to be galactic. The combined 
database provides a sample of over 35 000 galaxies. 

Astonishingly, from the core (cz < 250 km s _1 ) to the boundary of the 
sample volume (cz = 7500 km s -1 ), the number density of galaxies declines by 
a factor of more than 300 times. In part this is due to the core lying within a 
large-scale structure, while local voids are incorporated in the complete sample 
volume; this could account for a factor of ~6 times. The balance - a factor of 
some fifty times - mainly reflects the increasing incompleteness with distance, 
even though a redshift of cz = 7500 km s _1 would be considered extremely 
modest compared to modern redshift surveys. 

Because of this severe incompleteness problem, our approach here has been 
to work within spherical volumes of increasing radii - from a radius of cz = 250 
km s -1 out to the 7500 km s -1 limit - and to monitor the trends behind measured 
parameters. 

3. Minimal spanning trees 

Minimal spanning trees (hereafter MSTs) allow structures or sub-structures to 
be identified, according to a specified (maximum) percolation distance (e.g. 
Bhavasar Sz Splinter 1996, Krzewina & Saslaw 1996). They form the ideal tool 
for the current investigation. 

The data have been analysed by means of software that progressively creates 
ever larger MSTs. The program, written by one of us (D.T) first passes through 
the data to determine all separations between galaxies. It then works with ever 
increasing percolation distance - starting from zero - to build up MSTs. Initially 
numerous separate trees form, and grow as the percolation distance increases. 
In time, as the percolation distance increases still further, the trees begin to 
merge. If run to the ultimate conclusion, all trees merge into a single MST. 

This scenario is apparent from Figure 1, which shows the number of MSTs 
within a certain data set as a function of percolation distance. The tabular data 
from which that figure is prepared also records the number of galaxies involved 
in the tree structures. 

One version of the program allows the process to be visualised, so that 
one can see the trees build up and merge (as demonstrated during the oral 
presentation of this paper). That version also allows the programme to be 



4 



Fairall et ah 



halted at any point. A further aspect of the software is that it allows the viewer 
to fly around the data and its MSTs (and can even be viewed in 3D). Figure 2 
shows an example of the visualisation. 
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Figure 1. As the percolation distance increases from zero, so the number 
of MSTs in the data set increases. A maximum is reached, after which trees 
merge and the number declines. Eventually all the data is united as a single 
tree. In this case the data set out to cz = 1500 km s _1 has been used. 





Figure 2. Minimal spanning trees to 125 km s _1 are visible in this plot 
showing the Local Supercluster. The (obviously static) black and white ver- 
sion shown here does not allow stereoscopic depth to be perceived, nor the 
progressive build up of MSTs. 



The outcomes of the MST analyses are shown in Tables 1 and 2. The ten 
rows in those tables show that the procedure described above was run ten times, 
for ever larger samples, with distance limits (in redshift space) ranging from 
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Table 1. Results of pcrculation analysis for samples of increasing rcdshift. 
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Table 2. Percentage of galaxies included in MSTs. 
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cz < 250 km s _1 to cz < 7500 km s _1 (the full set of downloaded data), as 
indicated in the first column. The second column shows the number of galaxies 
in each of these sample volumes. The third column shows the dramatic decline 
in the density of the galaxies, due to the incompleteness of the data (as discussed 
above). 

The position of the peak of the curve (as shown in Figure 1) varies with 
the sample distance, as reflected in the fourth column of Table 1. Had it not 
been for the incompleteness of the data, the position of the peak would have, 
presumably, not shifted. Note, however, the rapid change in the first four data 
sets, after which the change is more uniform. Coincident with the jump is a 
change in the shape of the actual peak; it is asymmetric for the three sets with 
cz < 1000 km s _1 , favouring the lower values of percolation distance, hence the 
lower values in this column. For the other data sets, the peak is more symmetric 
(much as it appears in Figure 1) and favours the higher values seen in the table. 

Table 2 shows what percentages of the galaxies are included in MSTs run 
to certain percolation distances (to 6.25 km s _1 , to 12.5 km s" 1 etc). In spite 
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Figure 3. The percentage of galaxies included in MSTs varies according to 
both the data set used (horizontal axis) and the maximum allowed percolation 
distance (separate curves). See text for interpretation. 

of the dramatic differences in galaxy density, as one moves (down the table) to 
larger distance limits and sizes of databases, the variations in the percentages 
are relatively mild. 

Figure 3 depicts the variations in graphical form. The eleven curves plot- 
ted show the eleven (maximum) percolations of the MSTs. Their horizontal 
variations show how they decline - from their true values - with increasing in- 
completeness of data. The true values ought to be obtained by extrapolating 
the curves leftward to the zero limit of the sample, but caution needs to be 
exercised. 

First, it can be seen that the lower curves rise steeply upward towards the 
left of the plot. This indicates the presence of a large population of intrinsically 
faint galaxies, only detectable at very low redshifts. It raises the well-known 
debate about the faint end of the luminosity function, with opinions differing as 
to the quantity of galaxies present. Even the lowest of the eleven plots - that for 
the ridiculously low value of 6.25 km s _1 - looks as if it might rise dramatically 
towards the 100% level. However, in the absence of a complete volume limited 
sample, we cannot be sure that any of the lower five curves could or could not 
reach 100%. The only curve, for which we have firm evidence of it touching 
100%, is the next one up, that for a maximum percolation distance of 100 km 
s" 1 . ' 

The results here strongly suggest that every galaxy has a neighbour within 
100 km s" 1 , or possibly less, in redshift space. 

The final column of Table 1 shows the maximum percolation distance that 
the trees reach, at the point where all trees merge into a single entity; in other 
words it is the minimum percolation distance that would unite all galaxies in 
the data set. By contrast to the percentages, but not unexpectedly, it does show 
a large variation - mainly because of the increasing incompleteness of the data 
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as the limiting distance is increased. Again we can display it in graphical form, 
this time in Figure 4. 

To find the true value represented in the figure, we are again faced with an 
extrapolation to the left, and reflecting the experience in the previous figure, the 
line starts to curve as the axis is approached. However, a conservative estimate 
would be to adopt a value of 100 km s^ 1 , both because it is reached with 
the smallest data set, and because it would be indicated by a linear regression 
through the seven lowest data points. 

It also matches the previous value and confirms that all galaxies have neigh- 
bours within 100 km s _1 , or possibly less, and, similarly, such a percolation 
distance serves to interconnect all galaxies, at least those within the same large- 
scale structure. This is an improvement on the working definition originally 
advanced by the first author (F98) for galaxy-galaxy separations within a large- 
scale structure. However, it assumes almost a volume-limited sample. 

Figure 4, however, shows some discontinuity beyond the sample size of 
cz = 3000 km s _1 . This is not surprising as this represents the point where the 
data set expands beyond our local large-scale structure, the Virgo Supercluster. 
It raises the question as to what percolation distance would unite such large- 
scale structures. We have only a handful of such superclusters within cz < 7500 
km s~ x (i.e. Virgo, Centaurus, Perseus-Pisces, Fornax, Sculptor and Coma)to 
go on, but the offset of the line to 700 km s _1 suggests that as an upper limit. In 
short we interpret Figure 4 to say that all superclusters (large-scale structures) 
are interlinked at a percolation distance of 700 km s -1 (10 Mpc) or less. Thus 
all large-scale structures interconnect to form a single labyrinth of high density 
regions, as suggested earlier in the 'bathsponge' analogy. 

These findings are of course based only on data with cz < 7500 km s _1 , 
but hopefully - though not necessarily - they also describe the character of the 
nearby universe. 
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4. Discussion 

This investigation was prompted by the need to improve upon a definition for 
large-scale structures, yet it seems to reveal something more fundamental about 
the distribution of galaxies. 

Every galaxy has a neighbour (within 100 km s^ 1 = 1.4 h^Q Mpc), and that 
neighbour has a neighbour, such that galaxies are arrayed in filaments, or tree 
structures (as seen in Figure 2). This is also apparent in Figure 3 of Karachentsev 
et al. (these proceedings) and Figure 1 of Koribalski (these proceedings), both of 
whom use samples that include low (optical) luminosity galaxies. Karachentsev 
uses the term the 'Local Cosmic Web', to describe the filamentary structures, 
Koribalski 'a beautiful network of filaments'. Koribalski also states that 'HI 
surveys also significantly enhance the connectivity of large-scale filaments by 
filling in gaps populated by gas-rich galaxies'. From the general observation 
that 'slices of the universe' reveal a 'bubbly' texture (Koribalski mentions 'loops') 
the filaments and tree structures surround voids (much like the filaments in the 
shells of supernova remnants). On a larger scale 2MASS galaxies (Huchra, these 
proceedings) portray the 'Cosmic web' to a greater depth. 

The finding that galaxies are confined to filaments and trees extends the 
earlier finding that nearby galaxies were predominantly members of groups (see 
Burstein (Galaxy Groups), these proceedings, discussing Brent Tully's Nearby 
Galaxy Atlas). Similarly, in the open discussion at this conference, consensus 
emerged that there were no such things as 'field galaxies'. All galaxies are 
somehow connected to some structure or other. 

Clearly this tells us something fundamental about how and where galaxies 
were formed. However, since their formation gravity will have modified their 
distribution. In the same way that gravity has arrested and reversed the cosmo- 
logical expansion of our Local Group (such that the Great Galaxy in Andromeda 
and our Galaxy are now drawn towards one another) so the same must hold for 
similar sized groups. This is apparent from the peaking of the histograms (Fig- 
ure 1) showing how gravity has fragmented the distribution into a great number 
of groups (minimal spanning trees). 

The finding is generally in line with the overall scenario that large-scale 
structures formed from the condensation of overdensities; it may, however, sug- 
gest that galaxies might have formed by some sort of chain reaction. If current 
n-body simulations (e.g. Pearce et al 2001) accurately portray the formation of 
large-scale structures (if not individual galaxies), then they too should exhibit 
similar percolation properties to those found here. This will provide the basis 
for further investigations. 
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